(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
25 June 2009 (25.06.2009) 




PCT 



(10) International Publication Number 

WO 2009/077173 A2 



(51) International Patent Classification: 
C12N 15/09 (2006.01) 

(21) International Application Number: 

PCT/EP2008/0 10770 

(22) International Filing Date: 

17 December 2008 (17.12.2008) 



(25) Filing Language: 

(26) Publication Language: 



English 
English 



(30) Priority Data: 

61/008,249 



19 December 2007 (19.12.2007) US 



(71) Applicant (for all designated States except US): 
PHILOCHEM AG [CH/CH]; ETH Zurich, Wolf- 
gang-Pauli-Str. 10 HCI E520, CH-8093 Zurich (CH). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): NERI, Dario 
[IT/CH]; Department of Chemistry and Applied Bio- 
sciences, Swiss Federal Institute of Technology Zurich, 
Wolfgang-Pauli-Strasse 10, H C I G396, CH-8093 Zurich 
(CH). MELKKO, Samu [FI/CH]; Philochem AG, E 
T H Zurich, Wolfgang-Pauli-Strasse 10, H C I E520, 
CH-8093 Zurich (CH). MANNOCI, Luca [IT/CH]; 
Philochem AG, ETH Zurich, Wolfgang-Pauli-Strasse 10, 
HCI E520, CH-8093 Zurich (CH). BULLER, Fabian 
[DE/CH]; Department of Chemistry and Applied Bio- 
sciences, Swiss Federal Institute of Technology Zurich, 



Wolfgang-Pauli-Strasse 10, H C I G396, CH-8093 Zurich 
(CH). 

(74) Agents: WATSON, Robert et al.; Mewburn Ellis LLP, 33 
Gutter Lane, London, Greater London EC2V 8AS (GB). 

(81) Designated States ( unless otherwise indicated, for every 
kind of national protection available): AE, AG, AL, AM, 
AO, AT, AU, AZ, BA, BB, BG, BH, BR, BW, BY, BZ, CA, 

CH, CN, CO, CR, CU, CZ, DE, DK, DM, DO, DZ, EC, EE, 
EG, ES, FI, GB, GD, GE, GH, GM, GT, HN, HR, HU, ID, 
IL, IN, IS, JP, KE, KG, KM, KN, KP, KR, KZ, LA, LC, LK, 
LR, LS, LT, LU, LY, MA, MD, ME, MG, MK, MN, MW, 
MX, MY, MZ, NA, NG, NI, NO, NZ, OM, PG, PH, PL, PT, 
RO, RS, RU, SC, SD, SE, SG, SK, SL, SM, ST, SV, SY, TJ, 
TM, TN, TR, TT, TZ, UA, UG, US, UZ, VC, VN, ZA, ZM, 
ZW. 

(84) Designated States (unless otherwise indicated, for every 
kind of regional protection available): ARIPO (BW, GH, 
GM, KE, LS, MW, MZ, NA, SD, SL, SZ, TZ, UG, ZM, 
ZW), Eurasian (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European (AT, BE, BG, CH, CY, CZ, DE, DK, EE, ES, FI, 
FR, GB, GR, HR, HU, IE, IS, IT, LT, LU, LV, MC, MT, NL, 
NO, PL, PT, RO, SE, SI, SK, TR), OAPI (BF, BJ, CF, CG, 

CI, CM, GA, GN, GQ, GW, ML, MR, NE, SN, TD, TG). 

Declaration under Rule 4.17: 

— of invento rsh ip (Rule 4.17 (iv)) 

Published: 

— without international search report and to be republished 
upon receipt of that report 



(54) Title: DN A -ENCODED CHEMICAL LIBRARIES 



< 



o 



(57) Abstract: A method of preparing a DNA-encoded chemical library comprising members which comprise a chemical moiety, 
a linking moiety and a DNA moiety, comprising for each member the steps of: (I) coupling a initial building block to a initial 
coding single strand oligomer via a linking moiety to form a initial conjugate, the initial coding single strand oligomer comprising 
a first primer region, an initial coding region and a first annealing region; (II) optionally coupling a middle building block and 
middle coding single strand oligomer to said initial conjugate to form a middle conjugate, the coupling take place in either order, 
wherein: (a) the middle building block is coupled to the residue of the initial building block; and (b) the middle coding single strand 
oligomer comprises a middle coding region and second annealing region, and is coupled by: (i) annealing a complementary single 
strand oligomer which comprises a chemical modifier, a complementary first annealing region, a complementary middle coding 
region and a complementary second annealing region by interaction between the first annealing region and the complementary first 
annealing region; (ii) treating the conjugate formed with a DNA polymerase to elongate the initial coding single strand oligomer 
to be complementary to the complementary middle coding region and the complementary second annealing region; (iii) removing 
the complementary single strand oligomer by denaturing the DNA and capturing the complementary single strand oligomer via the 
chemical modifier; (III) coupling a final building block and final coding single strand oligomer to the initial or middle conjugate as 
appropriate to form a final conjugate, the coupling take place in either order, wherein: (a) the final building block is coupled to the 
residue of the initial building block, or may be additionally or alternatively be coupled to the residue of the middle building block (if 
present); and (b) the final coding single strand oligomer comprises a final coding region and a second primer region, and is coupled 
by: (i) annealing a complementary single strand oligomer which comprises a complementary first or second annealing region as 
appropriate, a complementary final coding region and a complementary second primer region, by interaction between the first or 
second annealing region and the complementary first or second annealing region, as appropriate; (ii) treating the conjugate formed 
with a DNA polymerase to elongate the initial coding single strand oligomer to be complementary to the complementary final coding 
region and the complementary second primer region, and to elongate the complementary coding strand to be complementary to the 
initial coding single strand oligomer. 
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DNA-ENCODED CHEMICAL LIBRARIES 

The present invention relates to methods of preparing DNA-encoded chemical libraries, libraries 
prepared by those methods, use of these libraries and particular compounds therein. 

5 

Introduction 

Identification of molecules that bind specifically to target proteins of interest is a formidable 
challenge. Technologies that facilitate the isolation of binding molecules have profound 
implications for pharmaceutical research because most drug development programs rely on the 

10 ability to isolate small organic compounds that bind to a given protein. With an aging population 
and an increased understanding of the mechanisms of disease at a molecular level, biomedical 
scientists are facing the demand for more and better drugs. Additionally, elucidation of the 
biological function of proteins will, in many cases, require access to specific ligands (an 
approach that is often termed 'Chemical Genetics' (Strausberg, R.L. and Schreiber, S.L., 

15 Science 300 (2003), 294-295). Techniques for the general, fast, inexpensive isolation of small, 
organic, binding molecules are lacking at present. 

Traditional high-throughput screening (HTS) procedures require the storage and handling of 
hundreds of thousands of chemical compounds and, typically, they rely on a biochemical assay 

20 (Keseru, G.M. and Makara, G.M., Drug Discov. Today 11 (2006), 741-748). Alternatives to 
HTS that have been introduced recently include dynamic combinatorial chemistry (Lehn, J.M. 
and Eliseev, A.V., Science 291 (2001), 2331-2332; Otto, S., et al.,Drug Discov. Today 7 (2002), 
117-125), small-molecule microarrays (Uttamchandani, M., et al., Curr. Opin. Chem. Biol. 9 
(2005), 4-13), fragment-based lead discovery (Carr, R.A., et al., Drug Discov. Today 10 (2005), 

25 987-992; Erlanson, D.A., Curr. Opin. Biotechnol. 17 (2006), 643-652) and DNA-encoded 
chemical libraries (Dumelin, C.E., et al., QSAR Comb. Sci. 25 (2006), 1081-1087; 
Scheuermann, J., et al., J. Biotechnol. 126 (2006), 568-581). DNA-encoded chemical libraries 
are collections of small organic molecules that are conjugated covalently to DNA tags that serve 
as identification bar codes. 

30 

In DNA-encoded chemical library, the organic molecule, or putative binding moiety, is linked 
physically to a DNA tag that carries the identity code of the molecule it is attached to. The 
selection procedure for DNA-encoded chemical libraries comprises incubating the library of 
encoded molecules with the target protein of choice and, after affinity capture, non-binding 
35 library members are separated from binding library members by, for example, coupling the 
target protein to a solid support and removing the supernatant (that contains the non-binders). 



WO 2009/077173 



PCT/EP2008/010770 



2 



The stringency of selection is controlled using suitable washing conditions. After elution of the 
DNA-tagged binders, the enriched DNA moieties are amplified by polymerase chain reaction 
(PCR), which allows very low amounts of template DNA to be detected. Subsequent decoding 
of the enriched DNA uses either DNA sequencing or hybridization to oligonucleotide 
5 microarrays, depending on the architecture of the library and its size. In some libraries, the DNA 
tags not only serve as identification bar codes, but can also guide the synthesis of the displayed 
molecule. 

It is probable that the concept for the construction of a DNA-encoded chemical library was 

10 proposed first by Sydney Brenner and Richard Lerner in 1992 (Brenner, S. and Lerner, R.A.. 
Proc. Natl. Acad. Sci. U. S. A. 89 (1992), 5381-5383; US 5,573,905). The authors postulated 
the alternating stepwise synthesis of a polymer (e.g. a peptide) and an oligonucleotide 
sequence (serving as an identification bar code) on a common linker (e.g. a bead) in split and 
pool cycles. After affinity capture on a target protein, the population of DNA tags of the selected 

15 library members would be amplified by PCR and, in theory, utilized for enrichment of the bound 
molecules by serial hybridization steps to a subset of the library. In principle, the affinity-capture 
procedure could be repeated, possibly resulting in a further enrichment of the active library 
members. Finally, the structures of the chemical entities would be decoded by cloning and 
sequencing the PCR products. The feasibility of the orthogonal, solid-phase synthesis of 

20 peptides and oligonucleotides was demonstrated by attaching a test peptide (the pentapeptide 
leucine enkephalin) and an encoding DNA tag onto controlled-pore glass beads (Nielsen, J., et 
a/., J. Am. Chem. Soc. 115 (1993)). The peptide bound to a specific antibody and the 
corresponding DNA coding tag was amplified by PCR. To date, the only implementation of the 
technology at a library level has been published by scientists at Affymax, who have constructed 

25 a collection of -10 6 heptapeptide sequences (based on seven amino acids) and their 
corresponding coding oligonucleotide tags on beads. The library was incubated with a 
fluorescently labelled anti-peptide antibody, and binders were selected successfully by 
fluorescent-assisted cell sorting (Needels et a/., M.C., Proc. Natl. Acad. Sci. U. S. A. 90 (1993), 
10700-10704). In the original paper, Brenner and Lerner suggested that the alternate synthesis 

30 of chemical compounds and oligonucleotides could also be implemented in the absence of 
beads. This approach has been followed by the companies NuEvolution (US 2006-0246450; 
WO 02/103008) and Praecis (WO 2005/058479) by using enzyme-catalyzed ligation of coding 
DNA blocks rather than chemical synthesis of oligonucleotides. 

35 Other methods of synthesizing DNA-encoded chemical libraries involve DNA-templated organic 
synthesis (Gartner, Z.J., et a/., Science 305 (2004), 1601-1605) and encoded self-assembly 



WO 2009/077173 



PCT/EP2008/010770 



3 

chemical libraries (Melkko, S., et a/., Nat Biotechnol. 22 (2004), 568-574; Sprinz, K.I., et a/. p 
Bioorg. Med. Chem. Lett. 15 (2005), pp. 3908-3911.) 

General Description 

5 The present inventors have developed a method of preparing DNA-encoded chemical libraries 
in which the initial coding of each building block of the chemical moiety is carried out by a single 
stranded oligomer, and where the double stranded DNA moiety in the final library members is 
completed using a DNA polymerase. 

10 A first aspect of the present invention provides a method of preparing a DNA-encoded chemical 
library comprising members which comprise a chemical moiety, a linking moiety and a DNA 
moiety, comprising for each member the steps of: 

(I) coupling a initial building block to a initial coding single strand oligomer via a linking 
moiety to form a initial conjugate, the initial coding single strand oligomer comprising a first 

15 primer region, an initial coding region and a first annealing region; 

(II) optionally coupling a middle building block and middle coding single strand oligomer to 
said initial conjugate to form a middle conjugate, the coupling take place in either order, 
wherein: 

(a) the middle building block is coupled to the residue of the initial building block; and 
20 (b) the middle coding single strand oligomer comprises a middle coding region and second 
annealing region, and is coupled by: 

(i) annealing a complementary single strand oligomer which comprises a chemical modifier, 
a complementary first annealing region, a complementary middle coding region and a 
complementary second annealing region by interaction between the first annealing region and 

25 the complementary first annealing region; 

(ii) treating the conjugate formed with a DNA polymerase to elongate the initial coding 
single strand oligomer to be complementary to the complementary middle coding region and the 
complementary second annealing region; 

(Hi) removing the complementary single strand oligomer by denaturing the DNA and 
30 capturing the complementary single strand oligomer via the chemical modifier; 

(III) coupling a final building block and final coding single strand oligomer to the initial or 
middle conjugate as appropriate to form a final conjugate, the coupling take place in either 
order, wherein: 

(a) the final building block is coupled to the residue of the initial building block, or may be 
35 additionally or alternatively be coupled to the residue of the middle building block (if present); 
and 
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(b) the final coding single strand oligomer comprises a final coding region and a second 
primer region, and is coupled by: 

(i) annealing a complementary single strand oligomer which comprises a complementary 
first or second annealing region as appropriate, a complementary final coding region and a 

5 complementary second primer region, by interaction between the first or second annealing 
region and the complementary first or second annealing region, as appropriate; 

(ii) treating the conjugate formed with a DNA polymerase to elongate the initial coding 
single strand oligomer to be complementary to the complementary final coding region and the 
complementary second primer region, and to elongate the complementary coding strand to be 

10 complementary to the initial coding single strand oligomer. 

A second aspect of the present invention provides a method of preparing a DNA-encoded 
chemical library comprising a chemical moiety, a linking moiety and a DNA moiety, comprising 
for each member the steps of: 
15 (I) coupling a initial building block to a initial coding single strand oligomer via a linking 
moiety to form a initial conjugate, the initial coding single strand oligomer comprising a primer 
region, an initial coding region and an annealing region; 

(II) coupling a middle building block and middle coding single strand oligomer to said initial 
conjugate to form a middle conjugate, the coupling take place in either order, wherein: 

20 (a) the middle building block is coupled to the residue of the initial building block; and 
(b) the middle coding single strand oligomer comprises a middle coding region and a 
restriction region, and is coupled by: 

(i) annealing a complementary single strand oligomer which comprises a complementary 
annealing region, a complementary middle coding region, a complementary restriction region 

25 and a chemical modifier, by interaction between the annealing region and the complementary 
annealing region; 

(ii) treating the conjugate formed with a DNA polymerase to elongate the initial coding 
single strand oligomer to be complementary to the complementary middle coding region and the 
complementary restriction region and to elongate the complementary single strand oligomer to 

30 be complementary to the initial primer region and initial coding region; and 

(iii) treating the conjugate with a non-palindromic restriction enzyme which acts on the 
restriction region to leave a middle conjugate, and capturing the restriction product via the 
chemical modifier; 

(III) coupling a final building block and final coding single strand oligomer to the middle 
35 conjugate as appropriate to form a final conjugate, the coupling take place in either order, 

wherein: 
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(a) the final building block is coupled to the residue of the initial building block and/or to the 
residue of the middle building block; and 

(b) the final coding single strand oligomer comprising a final coding region and a second 
primer region, and is coupled by ligating a terminal double-stranded portion comprising a 

5 ligation region that matches the remainder of the restriction region site of the middle conjugate, 
and a final coding region and second primer region, as well as the complementary regions in 
the complementary strand. 

In both the first and second aspects of the invention, it is preferred that the individual method 
10 steps are carried out at about the same time for each library member. In particular, it is 

preferred steps (I), (II) if present, and (III) follow a split and pool methodology. This well-known 
technique in combinatorial chemisty involves carrying out step (I) in an appropriate number of 
vessels for the number of building blocks, pooling the resulting initial conjugates and then 
splitting the mixture so obtained into the appropriate number of vessels for the number of 
15 building blocks in the subsequent step. This method can be repeated a third time if the method 
involves three groups of building blocks. 

A third aspect of the present invention provides a method of preparing a DNA-encoded 
chemical library comprising members which comprise a chemical moiety, a linking moiety and a 
20 DNA moiety, comprising for each member the steps of: 

(I) coupling a initial building block to a initial coding single strand oligomer via a linking 
moiety to form a initial conjugate, the initial coding single strand oligomer comprising a first 
primer region, an initial coding region and a first annealing region; 

(II) coupling a middle building block and middle coding single strand oligomer to said initial 
25 conjugate to form a middle conjugate, the coupling take place in either order, wherein: 

(a) the middle building block is coupled to the residue of the initial building block; and 

(b) the middle coding single strand oligomer comprises a middle coding region and a 
restriction region, and is coupled by: 

(i) annealing a complementary single strand oligomer which comprises a complementary 
30 first annealing region, a complementary middle coding region and a complementary restriction 

region by interaction between the first annealing region and the complementary first annealing 
region; 

(ii) treating the conjugate formed with a DNA polymerase to elongate the initial coding 
single strand oligomer to be complementary to the complementary middle coding region and the 

35 complementary restriction region; 

(iii) treating the conjugate with a restriction enzyme which acts on the restriction region to 
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leave a middle conjugate, and removing the restriction product not containing the coding 
regions with a chromatographic method; 

(III) coupling a final building block and final coding partially double stranded oligomer to the 
middle conjugate to form a final conjugate, the coupling take place in either order, wherein: 
5 (a) the final building block is coupled to the residue of the initial building block, and 
additionally or alternatively be coupled to the residue of the middle building block; and 
(b) the final coding partially double stranded oligomer comprises a final coding region, a 
second primer region, and a double stranded ligation region that matches the remainder of the 
restriction region site of the middle conjugate, and is coupled by ligation using an excess of the 
10 final coding partially double stranded oligomer. 

In the third aspect of the invention, it is preferred that the individual method steps are carried out 
at about the same time for each library member. In particular, it is preferred steps (I), (II) and 
(III) follow a split and pool methodology. This well-known technique in combinatorial chemisty 
15 involves carrying out step (I) in an appropriate number of vessels for the number of building 

blocks, pooling the resulting initial conjugates and then splitting the mixture so obtained into the 
appropriate number of vessels for the number of building blocks in the subsequent steps 

A fourth aspect of the present invention provides a library synthesised according to the method 
20 of either the first, second or third aspects of the invention. This aspect also comprises the 
particular library described below however made. 

A fifth aspect of the present invention provides the use of the library of the fourth aspect in a 
method of screening the library for a library member which shows affinity for a biological target. 

25 

Detailed Description 
Figures 

The first aspect of the invention is illustrated in figures 1 and 2. Figure 1 illustrates the invention 
where the optional middle step doesn't take place, whereas Figure 2 illustrates the invention 
30 with the optional middle step. 

The second aspect of the invention is illustrated in figure 3. 

Figure 4 shows the results of the screening of a library of the invention against streptavidin; 
35 figure 4a shows the frequency of the library members before selection; figure 4b shows the 

frequency of the library members after selection on empty beads; figure 4c shows the frequency 
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of the library members after selection against Streptavidin bead, as well as the identity of high 
frequency library members. 

Figure 5 shows the results of the screening of a library of the invention against IgG, with the 
5 identity of high frequency library members. 

Figure 6 shows the results of use of an IgG binding resin according to the invention. 

Figure 7 illustrates three embodiments of the first and second aspects of the invention, including 
10 gels resulting from the experiments described in Example 7. 

The third aspect of the invention is illustrated in Figure 8. 

Library Members 

15 Each member of the library comprises a chemical moiety, which is made from building blocks, a 
DNA moiety, which includes codes for the building blocks of the member, and a linking moiety, 
which links the chemical moiety to the DNA moiety. 

Library 

20 A library of this invention is a collection of chemical diverse library members. 

The number of different members in a library represents the complexity of a library and is 
defined by the number of building blocks in each member of the library, and by number of 
variants of each building block. The number of different members of any particular library can be 
25 determinded by multiplying the number of variants of each building block together. For 

example, if there are two building blocks, each of which have twenty variants, then the resulting 
library has 400 members. If, for example, there are three building blocks, each of which has 
twenty variants, then the resulting library has 8000 members. 

30 Because the number of different members of the library will determine the amount of a particular 
member relative the other members in the library, there are theoretical limits to the maximum 
size of a library. Therefore it is useful to consider how large a library should be. This size limit is 
dictated by the level of sensitivity for detecting the presence of the DNA moiety after the 
screening process. Detection sensitivity is dictated by the threshold of binding between an 

35 acceptor molecule to be assayed and a library member. 
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The relative amounts of the individual members within the library can vary from about 0.2 
equivalents to about 10 equivalents, where an equivalent represents the average amount of a 
member within the library. Preferably each member is present in the library in approximately 
equimolar amounts. 

5 

In a preferred embodiment, a library contains the every possible member based on the 
mathematical combinations for a given library. 

Chemical Moiety 

10 The chemical moiety is formed from the initial, middle (if present) and final building blocks. If 
there are only two building blocks, initial and final, these may be joined to one another by a 
chemical reaction (see below) and their residues will be linked by one or more chemical bonds. 

If there are three building blocks, then these can be joined in one of three ways. In a first way, 
15 the building blocks are linked in series, i.e. Initial-Middle-Final. In a second way, both middle 
and final building blocks are linked to the initial building block. In a third way, both middle and 
final building blocks are linked to the initial building block, as well as to each other other. 

Accordingly, the functionality required on each building block for linking to other building blocks 
20 will vary depending on the mode of binding desired. The initial building block will also require 
functionality so as to bind to the linking moiety. 

The reaction conditions used to link the building blocks must be compatible with the DNA moiety 
of the library members. Thus, the type of building blocks that are suitable for use in the present 
25 invention will be limited by suitable linking chemistry. 

Illustrative building blocks and reaction schemes are presented below. 

If an amino acid residue, or other amine containing moiety, (with appropriate amine protection) 
30 is the first building block, then a subsequent building block may be a carboxylic acid (or 

activated version thereof, e.g. ester, acid anhydride, acid halide). Appropriate conditions are 
shown in scheme 1, where the method is illustrated with respect to an amino acid residue: 
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P = A, Piperidine 1h, 4°C 




NHR 



Okie 



A B CD 

Scheme 1 

Thus, in some embodiments, if the first building block comprises a protected amine group, this 
5 is deprotected before being coupled with a carboxylic acid or activated form thereof. 

If the first building block comprises a double bond conjugated to an electron withdrawing group, 
then a subsequent building block being a thiol, alcohol, primary amine or enolate may be added, 
as shown in scheme 2: 




enolates 



EWG = Elctron Withdrawing Group 

Scheme 2 

Click chemistry may be used to join an azide containing moiety to a alkyne containing moiety, 
as shown in scheme 3: 
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I Oligo-N 3 + HC=C— R 



Oligo 



N 



Oligo 



N 



II Oligo— C^CH + N 3 — R 



Scheme 3 



Oligo' 



R 

/ 

N 
N 



— K I 



Ether-bond formation could be used to join a moiety comprising a hydroxy group and a moiety 
5 comprising a leaving group, such as halogen, OTf, ONf or OTs, as shown in scheme 4: 



<OH 



Oligo 



X 



R 1 = Alkyl 

^.0>^ X = halogen, OTf, ONf, OTs 

Oligo Ri 



Scheme 4 



Reductive amination could be used to link a moiety comprising a ketone group with a moiety 
10 comprising a primary amine group, as shown in scheme 5: 

R 2 



Oligo 




R2 



H 2 N 



Ri 



Oligo 




Ri 




R 2 



R2 



H 2 N 



^-Oligo 



Ri 




N 
H 



Oigo 



Scheme 5 



Suzuki cross-coupling may be used to link a boron containing moiety to a moiety with a leaving 
15 group, such as I, OTf, Br or CI, as shown in scheme 6: 

^BY 2 X Pd Catalyst 

Ri + ^ 



Oligo 



Bsae 



Oligo 



X = I, OTf, Br, CI 
Y = OH, OR 
R 1 = aromatic 
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The above schemes could also apply where the building block attached to the DNA moiety is 
the second building block, and the subsequent moiety added is the third building block, or to link 
the initial building block to the linking moiety. 

5 In one embodiment of the invention, the chemical moiety is formed from an initial and final 
building block, where the initial building block is an amino acid, and the final building block is a 
carboxylic acid. In particular, the collection of 20 initial building blocks is set out in Table 1: 
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and the collection of 200 final building blocks is set out in Table 2: 
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Accordingly the present invention provides a DNA-encoded chemical library comprising 
members which comprise a chemical moiety, a linking moiety and a DNA moiety, wherein the 
chemical moiety is the reaction product of an initial building block and a final building block, the 
5 initial building blocks being selected from table 1 and the final building blocks being selected 
from table 2 where the library comprises members representing the reaction product of at least 
three initial building blocks and at least 5 final building blocks. 

The library may comprise members representing the reaction product of at least five, ten fifteen 
10 or twenty initial building blocks and at least 10, 50, 100 or 200 final building blocks. 

Although it is preferred that the library comprises members representing every possible reaction 
product of the initial and final building blocks, in some embodiments, the library comprises 
members representing at least 75, 90 or 95% of the possible reaction products. 

15 

Linker Moiety 

The linker moiety can be any moiety that performs the function of operatively linking the 
chemical moiety to the DNA moiety. 

20 The linker moiety can vary in structure and length, and provide at least two features: (1) 

operative linkage to the chemical moiety and (2) operative linkage to the DNA moiety. As the 
nature of chemical linkages is diverse, any of a variety of chemistries may be utilized to effect 
the indicated operative linkages to both the chemical and DNA moieties, the nature of the 
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linkage is not considered an essential feature of this invention. The size of the linker moiety in 
terms of the length between the chemical and DNA moieties can vary widely, but for the 
purposes of the invention, need not exceed a length sufficient to provide the linkage functions 
indicated. Thus, a chain length of from at least one to about 20 atoms is preferred. 

5 

Typical linkers may be amino modifiers (such as 3-Amino-Modifier C3, 3'-Amino-Modifier C6 
dC, 3-Amino-Modifier C6 dT, 3'-Amino-Modifier C7, 3'-PT-Amino-Modifier C3, 3-PT-Amino- 
Modifier C6, S'-Amino-dT-CE, 5-Amino-Modifier 5, 5-Amino-Modifier C12, 5'-Amino-Modifier 
C3, 5-Amino-Modifier C6, Amino-Modifier C2 dT, Amino-Modifier C6 dA, Amino-Modifier C6 dC, 
10 Amino-Modifier C6 dG, Amino-Modifier C6 dT, Amino-Modifier C6-U, 5'-Amino Modifier C12), 
thiol modifiers (e.g. 3'-Thiol-Modifier C3 S-S, 5'-Thiol-Modifier C6, 3'-C6-Thiuol-Modifier S-S, 3'- 
C6-Thiol-Modifier, 5'-C6-Thiol-Modifier S-S, 5'-C6-Thiol-Modifier), carboxy modifiers (e.g. 3'- 
Carboxylate Photolabile C6, 5'-Carboxy-Modifier C10, Carboxy-dT), or aldehyde modifiers (e.g. 
5-Aldehyde-Modifier C2). These are readily commercially available 

15 

In the examples below, a 5'-Amino Modifier C12 was used as the linker molecule. This linker 
has the structure: 



Thus in some embodiments of the present invention, the linker moiety is a 5'-amino modifier 
20 C12. 

DNA moiety 

The DNA moeity is double-stranded and comprises in the first aspect of the invention a first 
primer region, an initial coding region, a first annealing region, a final coding region and a 
25 second primer region, as well as optionally a middle coding region and second annealing region 
between the first annealing region and the final coding region. In the second aspect of the 
invention, the DNA moiety comprises a first primer region, an initial coding region, an annealing 
region, a middle coding region, a restriction/ligation region, a final coding region and a second 
primer region. 

30 

The first and second primer regions provide a means to produce a PCR-amplified duplex DNA 
fragment derived from the library member using PCR. The typical length of primer regions is 
between 16 and 28 nucleotides (in single stranded format), or base pairs (in double stranded 




S'-Oligo 
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format). In double stranded format, one of the two DNA strands of the primer region can form a 
sequence specific dimer with a PCR primer at an appropriate hybridization temperature. Typical 
hibridization temperatures for the sequence specific hybridization of PCR primers to PCR primer 
regions is between 40 and 70°C. PCR primers can be longer than the hybridization region 
5 complementary to the primer region (at their respective 5' ends), e.g. for the introduction of 
additional sequences at the extremities of the PCR product that can be useful for later steps of 
the decoding process. In the examples below, PCR primers DEL_P1_A (5'- GCC TCC CTC 
GCGCCATCA GGG AGC TTG TGA ATT CTG G-3') and DEL_P2_B (S'- GCC TTG CCA GCC 
CGC TCA G GT AGT CGG ATC CGA CCA C-3') were used. The sequences that are not 
10 underlined, are complementary to one of the 2 strands of the PCR primer regions on the double 
stranded library members. 

Each coding region needs to contain sufficient nucleotides to uniquely identify the building block 
for which it is coding. For example, if the building block has 20 variants, the coding region 

15 needs to contain at least 3 nucleotides (4 2 = 16, 4 3 = 64). The coding region may be longer 
than necessary. The benfit of employing coding regions that are longer than necessary is that 
they provide the opportunity to differentiate codes by more than just a single nucleotide 
difference, which gives more confidence in the decoding process. In the examples below, the 
initial building block (20 compounds) was encoded by 6 nucleotides, and the second building 

20 block (200 compounds) was encoded by 8 nucleotides. 

The annealing regions need to be of a sufficient length to allow for good recognition and binding 
between the oligonulceotides. Typically, an annealing region will contain at least 10 
nucleotides, and more preferably at least 18 nucleotides. Normally, the annealing region will 
25 contain no more than 28 nucleotides. If more than one annealing region is present, they should 
have different sequences. In the examples below, the annealing region contains 18 
nucleotides. 

As with hybridization with PCR primers, typical hybridization temperatures for the sequence 
30 specific annealing of two annealing regions are between 40°C and 70°C. 

The restriction region used in the second aspect of the invention provides a site for restriction 
with a non-palindromic restriction enzyme that leaves a non-blunt (cohesive) end. This non- 
blunt end is matched by the ligation region used to couple the final coding region and second 
35 primer. The restriction region will contain preferably 6 nucleotides and normally no more then 8 
nucleotides. The non-palindromic restriction site is required to prevent any non-specific ligation 
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on the final coding step. The non-palindromic restriction enzyme, may be, for example, BssSI 
(5'- CAC GAG -3'). BmgBI (5'- CAC GTC -3'). BbvCI (5'- CCT TCA GC -3'), BseYI (5'- CCC AGC -3') 
or BsrBI (5'- CCG CTC -3'). The enzyme restriction sequences are underlined. 

5 In the examples below the restriction region contains 6 nucleotides and is specific for BssSI 
enzyme (5'- CAC GAG -3'). Thus, in embodiments of the invention the restriction region contains 
6 nucleotides and is specific for BssSI enzyme (5'- CAC GAG -3'). 

The chemical modifier used in the first and second aspects of the invention may be a biotin 
10 moiety, or an imminobiotin moiety or any suitable chemical moiety that allows a selective 
capture by a macromolecular entity immobilized on a solid support, preferably high density 
coated streptavidin sepharose resin. 

Thus, in embodiments of the invention the chemical modifier is either a 3'-Biotin-C6-modification 
15 or a 5'- Biotin-C6-modification. The modification has the structure: 




5' or 3'-Oligo 



The restriction region used in the third aspect of the invention provides a site for restriction with 
a restriction enzyme that leaves a non-blunt (cohesive) end. This non-blunt end is matched by 

20 the ligation region used to couple the final coding region and second primer. The restriction 
region will contain preferably 6 nucleotides and normally no more then 8 nucleotides. The 
restriction site is not required to be non-palindromic, as in the third aspect of the invention an 
excess of the final coding partially double stranded oligomers are used in the ligation. The 
restriction enzyme, may be, for example, BamHI (5- GGA TCC -3'). or EcoRI (5'-GAA TTC-3'). 

25 The enzyme restriction sequences are underlined. 

In example 8 below the restriction region contains 6 nucleotides and is specific for BamHI 
enzyme (5'- GGA TCC- 3'). Thus, in embodiments of the third aspect of the invention the 
restriction region contains 6 nucleotides and is specific for BamHI enzyme (5'- GGA TCC -3'). 

30 

Method steps 

Coupling initial building block to initial coding single strand oligomer 
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The initial coding strand oligomer may be derivatised by a linking moiety using the appropriate 
conditions, as are well known in the art. In many embodiments, the initial coding strand 
oligomer may be puchased with the linking moiety already attached. The initial building block is 
reacted with the reactive terminus of the linking moiety under conditions which do not affect the 
5 conjugated oligomer. Different initial building blocks are conjugated to different oligonucleotides 
that bear coding regions specific for the initial building block they are coupled to. 

Coupling building blocks 

The building blocks may be coupled to each other using, for example, the chemistries set out 
10 above. As discussed, the reaction conditions used must be suitable to be used in the presence 
of single or double stranded DNA. 

Annealing DNA 

The annealing of the various oligonucleotides described above occurs by virtue of hybridization 
15 mediated by the complementary sequences of the annealing and complementary annealing 
regions. Incubating the mixture at an appropriate hybridization temperature (typically between 
40 and 70°C) facilitates sequence specific annealing. 

Use of DNA polymerase 

20 By addition of desoxynucleotides (dNTPs) and a polymerase (e.g. Klenow polymerase) in a 
suitable buffer at the appropriate point in the method, and incubation at a suitable temperature 
(e.g. 37°C for Kenow polymerase), the DNA strands are elongated, forming a double stranded 
DNA segment. 

25 Denaturing DNA and capture via chemical modifier 

The double stranded DNA with suitable chemical modification is incubated with denaturing 
buffer (e.g. urea 4M aqueous) and eventually heated at 94°C. Following addition of the suitable 
macromolecular capturing agent immobilized on a solid support (e.g. high density coated 
streptavidin sepharose resin), the sample is incubated at an appropriate temperature, the 

30 supernatant separated by filtration and the desired single strand DNA isolated in the suitable 
buffer. 

Treatment with restriction enzyme and capture via chemical modifier 

Conjugates that are part of the second aspect of the invention containing a restriction site are 
35 treated with a restriction enzyme (e.g. BssSI) in a suitable buffer, and then incubated at the 
appropriate temperature for the enzyme (e.g. 37°C for BssSI). Two restriction fragments are 
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formed. On the one hand, a DNA fragment with the putative binding moiety conjugated to a 
DNA restriction fragments containing the coding tags, and on the other hand, a DNA restriction 
fragment with a chemical modifier, suitable for affinity capture. Subsequent addition of a suitable 
macromolecular ligand immobilized on a solid support (e.g. high density coated streptavidin 
5 sepharose resin), allows the separation by filtration of both restriction fragments. 

Treatment with restriction enzyme 

Conjugates that are part of the third aspect of the invention containing a restriction site are 
treated with a restriction enzyme (e.g. BamHI) in a suitable buffer, and then incubated at the 
10 appropriate temperature for the enzyme (e.g. 37°C for BamHI). Two restriction fragments are 
formed. On the one hand, a DNA fragment with the putative binding moiety conjugated to a 
DNA restriction fragments containing the coding tags, and on the other hand, a DNA restriction 
fragment. This DNA restriction fragment can be removed by chromatographic means. 

15 

Ligation of DNA 

Ligation may be carried out by mixing of two sets of pre-hybridized suitable pairing 
oligonucleotides with cohesive ends, and a ligase enzyme (e.g. T4 ligase) in a suitable buffer, 
and incubation at appropriate temperature (e.g. 25°C for T4 ligase). 

20 

In the third aspect of the invention, an excess of oligonucleotides is used in the ligation step. 
Uses of DNA-encoded libraries 

The libraries of this invention provide a repertoire of chemical diversity such that each chemical 
25 moiety is linked to a DNA moiety that facilitates identification of the chemical moiety. 

By the present screening methods, it is possible to identify optimised chemical structures that 
participate in binding interactions with a biological macromolecule by drawing upon a repertoire 
of structures randomly formed by the association of diverse building blocks without the 
30 necessity of either synthesising them one at a time or knowing their interactions in advance. 

The invention therefore also contemplates a method for identifying a chemical moiety that 
participates in a preselected binding interaction between the chemical moiety and a biological 
macromolecule. The chemical moiety to be identified is represented by one of the members of a 
35 library of this invention, and the method comprises the following steps: 

(a) a library according to the present invention is admixed with a preselected biological 
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macromolecule under binding conditions (i.e., a binding reaction admixture) for a time period 
sufficient for the biological macromolecule to interact with at least one member of the library 
and form a binding reaction complex; 

(b) the binding reaction complex is then isolated from the library admixture to form an isolated 
5 complex; 

(3) the sequence of the DNA moiety present in the isolated binding reaction complex is 
determined, this identifying the chemical moeity that participated in the binding reaction. 

A typical biological macromolecule exhibiting a preselected binding interaction can be any of a 
10 variety of molecules (e.g. proteins) that bind selectively to another molecule, including 

antibodies to antigens, lectins to oligosaccharides, receptors to ligands, enzymes to substrates 
and the like mediators of molecular interactions. Therefore, a preselected binding interaction is 
defined by the selection of the biological macromolecule with which a library member is to bind. 

1 5 Binding Reaction Admixtures 

The admixture of a library of the invention with a biological macromolecule can be in the form of 
a heterogeneous or homogeneous admixture. Thus, the members of the library can be in the 
solid phase with the biological macromolecule present in the liquid phase. Alternatively, the 
biologically active molecule can be in the solid phase with the members of the library present in 

20 the liquid phase. Still further, both the library members and the biological macromolecule can be 
in the liquid phase. 

Binding conditions are those conditions compatible with the known natural binding function of 
the biological macromolecule. Those compatible conditions are buffer, pH and temperature 
25 conditions that maintain the biological activity of the biological macromolecule, thereby 
maintaining the ability of the molecule to participate in its preselected binding interaction. 
Typically, those conditions include an aqueous, physiologic solution of pH and ionic strength 
normally associated with the biological macromolecule of interest. 

30 For example, where the binding interaction is to identify a member in the library able to bind an 
antibody molecule, the preferred binding conditions would be conditions suitable for the 
antibody to immunoreact with its immunogen, or a known immunoreacting antigen. For a 
receptor molecule, the binding conditions would be those compatible with measuring receptor- 
ligand interactions. 

35 

A time period sufficient for the admixture to form a binding reaction complex is typically that 
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length of time required for the biological macromolecule to interact with its normal binding 
partner under conditions compatible with interaction. Although the time periods can vary 
depending on the molecule, admixing times are typically for at least a few minutes, and usually 
not longer than several hours, although nothing is to preclude using longer admixing times for a 
5 binding reaction complex to form. 

A binding reaction complex is a stable product of the interaction between a biological 
macromolecule and a bifunctional molecule of this invention. The product is referred to as a 
stable product in that the interaction is maintained over sufficient time that the complex can be 
10 isolated from the rest of the members of the library without the complex becoming significantly 
disassociated. 

Isolation of a Library Member from the Binding Reaction Admixture 

A binding reaction complex is isolated from the binding reaction admixture by any separation 
15 means that is selective for the complex, thereby isolating that library member, or members, 
which hase/have bound to the biological macromolecule. There are a variety of separation 
means, depending on the status of the biological macromolecule. 

For example, the biological macromolecule can be provided in admixture in the form of a solid 
20 phase reagent, i.e., affixed to a solid support, and thus can readily be separated from the liquid 
phase, thereby removing the majority of library members. Separation of the solid phase from the 
binding reaction admixture can optionally be accompanied by washes of the solid support to 
rinse library members having lower binding affinities off of the solid support. 

25 Alternatively, for a homogeneous liquid binding reaction admixture, a secondary binding means 
specific for the biological macromolecule can be utilized to bind the molecule and provide for its 
separation from the binding reaction admixture. 

For example, an immobilized antibody immunospecific for the biological macromolecule can be 
30 provided as a solid phase-affixed antibody to the binding reaction admixture after the binding 
reaction complex is formed. The immobilized antibody immunoreacts with the biological 
macromolecule present in the binding reaction admixture to form an antibody-biological 
macromolecule immunoreaction complex. Thereafter, by separation of the solid phase from the 
binding reaction admixture, the immunoreaction complex, and therefor any binding reaction 
35 complex, is separated from the admixture to form isolated library member. 
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Alternatively, a binding means can be operatively linked to the biological macromolecule to 
facilitate its retrieval from the binding reaction admixture. Exemplary binding means are one of 
the following high affinity pairs: biotin-avidin, protein A-Fc receptor, ferritin-magnetic beads, and 
the like. Thus, the biological macromolecule is operatively linked (conjugated) to biotin, protein 
5 A, ferritin and the like binding means, and the binding reaction complex is isolated by the use of 
the corresponding binding partner in the solid phase, e.g., solid-phase avidin, solid-phase Fc 
receptor, solid phase magnetic beads and the like. 

The use of solid supports on which to operatively link proteinaceous molecules is generally well 
10 known in the art. Useful solid support matrices are well known in the art and include cross- 
linked dextran such as that available under the tradename SEPHADEX from Pharmacia Fine 
Chemicals (Piscataway, N.J.); agarose, borosilicate, polystyrene or latex beads about 1 micron 
to about 5 millimeters in diameter, polyvinyl chloride, polystyrene, cross-linked polyacrylamide, 
nitrocellulose or nylon-based webs such as sheets, strips, paddles, plates microtiter plate wells 
15 and the like insoluble matrices. 

Determining the Identifier Sequence 

The DNA moieties of library members that bind to the target biological molecules are amplified 
by polymerase chain reaction (PCR), which allows very low amounts of template DNA to be 
20 detected. Subsequent decoding of the enriched DNA uses either DNA sequencing or 

hybridization to oligonucleotide microarrays, depending on the architecture of the library and its 
size. 

A preferred method for decoding is the use of high throughput sequencing methods, such as the 
25 454-Roche Genome Sequencer system. For sequencing with the 454-Roche Genome 
Sequencer system, PCR products have to contain suitable adaptor sequences at their 
extremities (calles adaptor sequence A and B), which can be either added after a PCR reaction 
by ligation, or they can be incorporated in the PCR reactions, if the PCR primers contain on 
their 5'-ends sequences corresponding to an adaptor region. For PCR amplification of the DNA 
30 moieties in the examples below, PCR primers DEL_P1_A (5'- GCC TCC CTC GCG CCA T CA 
GGG AGC TTG TGA ATT CTG G-3') and DEL_P2_B (5'- GCC TTG CCA GCC CGC TCA G GT 
AGT CGG ATC CGA CCA C-3') were used. The sequences that are not underlined, are 
complementary to one of the 2 strands of the PCR primer regions in the compounds. The 
underlined sequence of DEL_P1_A corresponds to the sequence of adaptor region A, and the 
35 underlined sequence of DEL-P2_B corresponds to adaptor sequence B. 
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The next step of a particular sequencing process is the annealing of PCR amplicons on DNA 
Capture Beads, emulsification of beads and PCR reagents in water-in-oil microreactors, and 
clonal emPCR amplification inside these microreactors. After breaking of the emulsion, the 
Capture beads are mixed with Enzyme Beads, and loaded on a PicoTiterPlate. Pyrosequencing 
5 allows the recording of individual sequences for each DNA species displayed at Capture Beads, 
trapped in the wells of PicoTiterPlates. This allows the parallel sequencing of a vast amount 
(typically more than 100,000 per PicoTiterPlate) of individual DNA species at a time. With 
further improvement of the sequencing technology, it will be possible to sequence more than 
1,000,000 individual DNA species at a time. 

10 

IgG binders 

By means of the work carried out by the inventors, which is described in the examples below, 
certain chemical moieties have been identified which bind to IgG. These moieties are those 
derived from: 




15 

Compound 02-40 and 16-40 once bound to a solid support via the acid group have been shown 
in the examples below to act as affinity chromatography resins for IgG Cy5 labelled. 

Accordingly a first further aspect of the present invention provides a compound selected from 
20 02-40, 16-40 02-69 and 18-69. These compounds may used as affintity ligands in an affinity 
chromatography resin for IgG. 

A second further aspect of the present invention provides an affinity chromatography resin for 
IgG comprising as the affinity ligand a moiety comprising a pair of constituent moieties being: 
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More preferably the pair of constituent moieties are: 




The constituent moieties are linked together and to the sold support by a chemical linking group. 
5 The chemical linking group may be of the formula: 

H 

wherein X is selected from O or CH 2 ; 
n is 0, 1 or 2; and 
m is 2, 3 or 4. 

In preferred embodiments of this aspect the affinity ligand is selected from 02-40, 16-40, 02-69 
and 18-69, and more preferably 02-40 and 16-40. 




A third further aspect of the invention relates to the use of the affinity chromatography resin of 
1 5 the second further aspect for the purification of IgG. 



The affinity chromatograpy resin used to immobilize the IgG binding moieties may comprise an 
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agarose carbohydrate polymer, for example, Sepharose Fast Flow. A highly cross-linked matrix 
makes the support more rigid which in turn improves the pressure/flow characteristic. In order to 
allow the biomolecules a rapid and unhindered access to the immobilized affinity ligand, the 
particle size needs to be no higher than 180 ^im and preferebly spherical in the range 45-165 
5 ^im. The swelling factor is preferably higher then 3 drained medium/g. Furthermore the resin 
needs a wide range of pH and chemical stability to prevent any side reactivity during the 
washing and elution in various conditions. 

In an embodiment of the invention, the resin support is Sepharose 4 Fast Flow resin (GE 
10 Healthcare) with a particle size 45-165 urn, swelling factor 4-5 drained medium/g and a pH 
stability range between 3-11. 

Examples 

General Experimental Methods 

15 Reagents Unless otherwise denoted, chemical compounds and proteins were from Sigma- 
Aldrich-Fluka (Buchs, Switzerland), resin for solid phase synthesis from Novabiochem 
(Laufelfingen, Switzerland), enzymes from New England Biolabs (Germany) and HPLC grade 
lyophilized oligonucleotides were from IBA GmbH (Gottingen Germany). SpinX columns were 
purchased from Corning Costar Incorporated (USA, cat.no 8162) and ion-exchange cartridges 

20 for DNA purification from Qiagen, (PCR purification cat.no 28104, Nucleotides removal cat.no 
28306) and used according to the protocol described by the provider. 

Example 1: Library Synthesis 

Summary 

25 A DNA-encoded chemical library was constructed consisting of 20 x 200 modules, joined 
together by the formation of an amide bond. Initially, 20 Fmoc-protected aminoacids were 
chemically coupled to 20 individual amino-tagged oligonucleotides. Following deprotection and 
HPLC purification, the 20 resulting DNA-encoded primary amines were coupled to 200 
carboxylic acids, in order to generate a library of 4000 members. In order to ensure that each 

30 library member contained a different DNA code, a split-and-pool strategy was chosen, which 
minimizes the number of oligonucleotides needed for library construction. The 20 Fmoc- 
protected aminoacids covalently linked to individual single-stranded oligonucleotides were 
mixed and aliquoted in 200 reaction vessels, prior to the coupling step with the 200 different 
carboxylic acids (one per well). The identity of the carboxylic acids used for the coupling 

35 reactions was encoded by performing an annealing step with individual oligonucleotides, 
partially complementary to the first oligo carrying the chemical modification. A successive 
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Klenow polymerization step yielded double stranded DNA fragments, each of which contained 
two identification codes (one corresponding to the initial 20 compounds and one corresponding 
to the 200 carboxylic acids). The 200 compound mixtures were then purified on an anion 
exchange cartridge and pooled. Model reactions performed prior to library construction had 
5 shown that the yields of the amide bond forming reaction ranged between 51% and 100%. The 
resulting DNA-encoded chemical library, containing 4'000 compounds, was dissolved in water, 
aliquoted at a total DNA concentration of 600 nM and stored at -20°C prior to further use. 

Detailed Information 

10 The individual organic compounds to be coupled to the 5' amino-modified 42-mer 

oligonucleotides were dissolved in DMSO stock solution (100mM), occasionally by further 
addition of water or diluted hydrochloric acid. All HPLC were performed on an XTerra Prep RP 18 
column (5pM, 10x150mm) using a linear gradient from 10% to 40% MeCN in 100mM TEAA, pH 
7. All the LC-ESI-MS were performed on an XTerra RP 18 column (5pM, 4.6x20mm) using a 

15 linear gradient from 0% to 50% MeOH over 1min in 400mM HFIP/5mM TEA. The mass spectra 
were measured from 900 to 2000 m/z by a Waters Quattro Micro instrument. 

Coupling reactions of 20 fmoc-protected aminoacids 

To a reaction volume of 310jjL, containing 70%(v/v) DMSO/water, compounds were added to 
20 the respective final concentrations: fmoc-protected-aminoacids DMSO solution, 30mM; N- 
hydroxysulfosuccinimide (Fluka,) in DMSO, 10mM; N-ethyl-N'-(3<limethylaminopropyl)- 
carbodiimide (Fluka) in DMSO, 4mM; triethylamine hydrochloride water solution, pH 9.0, 80mM; 
oligonucleotide (IBA) water solution, SOpM, (5'- 

GGAGCTTGTGAATTCTGGXXXXXXGGACGTGTGTGAATTGTC-3', XXXXXX unambiguously 
25 identifies the individual fmoc-protected aminoacid compounds). 

Table 1 lists the amino acids and the oligonucleotide code used to identify them: 



Table 1 



AA 


Structure 


Name 


Oligonucleotide Code 


1 


9 

.COOH 

FmocHN 


(S)-3-(((9H-fluoren-9- 
yl)methoxy)carbonylamino)-3- 
(pyridin-4-yl)propanoic acid 


A 


T 


C 


T 


T 


A 
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2 


\ 

^ \^ .NHFmoc 

\ / 

N COOH 


3-(((9H-fluoren-9- 
yl)methoxy)carbonylamino)-4-(4- 
bromophenyl)butanoic acid 


G 


C 


T 


G 


C 


G 


3 


COOH 

L 

V^i-^NHFmoc 


(1 R,2S)-2-(((9H-fluoren-9- 
yl)methoxy)carbonylamino)cyclo 
pentanecarboxylic acid 


A 


G 


A 


A 


C 


G 


4 


HOOC 

v ' NHFmoc 


3-(((9H-fluoren-9- 
yl)methoxy)carbonylamino)-3- 
(pyridin-2-yl)propanoic acid 


G 


A 


C 


A 


T 


C 


5 


NHFmoc 
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All coupling reactions were stirred overnight at room temperature; residual activated species 
were then quenched by addition of piperidine (Fluka) 500mM DMSO. Prior to HPLC purification 
500pL of 100mM TEAA, pH 7.0, was added to the reaction mixture. The reactions were then 
5 purified by HPLC and the desired samples were redissolved in 100pL of water and an amount 
of 10|jL was analyzed by LC-ESI-MS. The samples showed the expected desired fmoc- 
deprotected product. Typical coupling yields were >54% overall. 4.0 nmol of each DNA 
conjugated compound were pooled together to generate a 20 member DNA encoded sub- 
library. 

10 

Coupling reactions of 200 carboxylic acids 

To a reaction volume of 310pL, containing 70% (v/v) DMSO/water, compounds were added to 
the respective final concentrations: DMSO-dissolved carboxylic acid, 30mM; N- 
hydroxysulfosuccinimide (Fluka) in DMSO, 10mM; N-ethyl-N'-(3-dimethylaminopropyl)- 

15 carbodiimide (Fluka) in DMSO, 4mM; triethylaminehydrochloride, pH9.0 water solution, 80mM; 
DNA-oIigonucleotide sub-library pool, 1 .5pM. All coupling reactions were stirred overnight at 
room temperature; residual activated species were then quenched by addition of 50pL Tris-CI 
buffer, 500mM pH 9.0. The mixture was allowed to quantitatively precipitate by sequential 
addition of 25pL of 1M acetic acid, 12.5yL of 3M sodium acetate buffer, pH 4.7 and SOOpL 

20 ethanol followed by 2 hours incubation at -23°C. The mixture was centrifuged and the resulting 
oligonucleotide pellet was washed with ice-cold 90% (v/v) ethanol and then dissolved in lOOpL 
millipore water. Test coupling reactions were also performed with the reaction conditions 
described above; using model 42mer 5'-fmoc-deprotected aminoacids-oligonucleotide 
conjugate and model carboxylic acids. The reactions were analysed by HPLC and the masses 

25 of the reacted oligonucleotides detected by LC-ESI-MS. Typical HPLC coupling yields on this 
step were >58% with purity >46%. 

Polymerase Klenow encoding of 200 carboxylic acids reactions 

To a reaction volume of 50pL, reagents were added to the respective final concentrations: pool 
30 of compounds 320nM, 44mer oligonucleotide (5- 

GTAGTCGGATCCGACCACXXXXXXXXGACAATTCACACACGTCC-3', XXXXXXXX 
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unambiguously identifies the individual carboxylic acid compound, IBA) 600nM, Klenow buffer 
(NEB, cat.no B7002S), dNTPs (Roche, cat.no 11969064001), 0.5mM, Klenow Polymerase 
enzyme (NEB, cat.no M0210L), 5 units. All the Klenow polymerization were incubated at 37°C 
for 1 hour and then purified on ion-exchange cartridge (Qiagen, cat.no 28306). All the 200 
5 purified reactions were dissolved in 50pL of water and pooled to generate the 4000 member 
library (DEL4000) to a final oligonucleotide concentration of 300 nM. 

Table 2 lists the carboxylic acids and the oligonucleotide code used to identify them: 
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Preparation of the D-desthiobiotin oligonucleotide-conjugate (positive control) 
The D-desthiobiotin-oligonucleotide-conjugate was synthesized and unambiguously encoded as 
5 described above. Prior Polymerase Klenow reaction, HPLC purification was performed and LC- 
ESI-MS showed the expected D-desthiobiotin oligonucleotide conjugated mass. 

Example 2: Library Screening against streptavidin 

Summary 

10 Selections on streptavidin-sepharose were performed using experimental panning conditions 
essentially identical to the ones previously described by some of the inventors with a smaller 
single-pharmacophore DNA-encoded chemical library (Dumelin C.E., etal. t Bioconjug Chem., 
17(2), 366-370 (2006). 

15 Details 

The library of Example 1, 300nM, was diluted 1:30 in PBS, spiked with D-desthiobiotin 
oligonucleotide-conjugate (final concentration 10pM) and 100pL was added to 50pl of 
streptavidin-sepharose slurry (GE Healthcare, cat.no 17-51 13-01) or of sepharose slurry without 
streptavidin preincubated with PBS, 0.3mg/mL herring sperm DNA (Sigma). After incubation for 
20 1 hour at 25°C the mixture was transferred to a SpinX column (Corning Costar Incorporated), 
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the supernatant removed, and the resin washed 4x400pL of PBS. At last the resin was 
resuspended in 100pL of water. 

Identification of binding molecules 
5 The codes of the oligonucleotide-compound conjugates were amplified before and after 

selection by PCR (50pL, 25 cycles of 1min at 94°C, 1min at 55°C, 40s at 72°C) either of 5^L of 
the resuspended resin or of 5pL of 100fM DEL4000 using primers DEL_P1_A (5'- 
GCCTCCCTCGCGCCATCAGGGAGCTTGTGAATTCTGG-3') and DEL_P2_B (5'- 
GCCTTGCCAGCCCGCTCAGGTAGTCGGATCCGACCAC-3 , ) that additionally contain at one 
10 extremity a specific domain (19 nucleotide) required for high-throughput sequencing. The PCR 
product was purified on ion-exchange cartridge (Qiagen, 28104). Subsequent high-throughput 
sequencing was performed on a 454 Life Sciences-Roche GS FLX Sequencer platform. 

For each high-throughput sequencing experiments, an average of 71 16 sequences have been 
15 evaluated and plotted on 3D maps, as shown in Figure 4. This shows the results of the high- 
throughput sequence analysis performed on the library before selection (Figure 4a), on fractions 
eluted from an unmodified sepharose resin used as negative control (Figure 4b), or from 
streptavidin-coated sepharose (Figure 4c). 

20 As expected, compounds in the library prior to selection were found to be represented in 
comparable amounts. When sequencing 1 1,933 individual codes from the library before 
selection, the average counts and the standard deviations for the 4,000 compounds were found 
to be 1 .72 ± 1 .42. Similarly, no striking enrichment was observed for selections on empty resin, 
even though some compounds displayed a 4-5-fold enrichment as a result of the panning 

25 procedure. By contrast, the decoding of the streptavidin selection revealed a preferential 
enrichment of certain classes of structurally-related compounds. In addition to desthiobiotin, 
which had been spiked into the library as positive control prior to selection, an enrichment of 
derivatives of the thioester moiety 78, of the ester moiety 49, as well as of pharmacophores 54, 
73, 175 and 188 was observed. Fluorescent amide derivatives of compounds 49 and 78 had 

30 previously been found to bind to streptavidin with dissociation constants in the ~1 mM range, as 
assessed by fluorescence polarization assays (Dumelin, et a/., 2006), while 54, 73, 175 and 188 
had not previously been reported as streptavidin binders. 

Example 3: Resynthesis of binding molecules to assess dissociation constants 
35 In order to assess whether the extension of the pharmacophore 49 and 78 moieties within the 
4,000-membered chemical library (see Figure 4) would contribute to an increased affinity 



WO 2009/077173 



PCT/EP2008/010770 



72 

towards streptavidin, the dissociation constants of the most relevant compounds were 
measured, following conjugation to fluorescein. 

Synthesis 

5 In a on-resin peptide synthesis syringe with 50mg (46pmol) of 0-bis-(aminoethyl)ethylene glycol 
trityl resin (Novabiochem, cat.no 01-64-0235) were introduced in dry DMF solution the 
appropriate fmoc-protected aminoacid (100pmol, 1mL), HBTU (Aldrich, 200Mmol, 1mL), DIEA 
(Fluka, 400|jmol l 0.5mL). After overnight incubation at room temperature, the resin was washed 
6x2mL dry DMF and the fmoc moiety removed by addition of 1mL piperidine (Fluka) 50% dry 

10 DMF for 1 hour at room temperature. Following washing 6x2mL dry DMF, the corresponding 
carboxylic acid (100pmol, 1mL DMF) was introduced and a further amide bond formation 
reaction performed as described above. The resulting product was cleaved treating the resin 
with 10x2mL TFA 1% in CH 2 CI 2 . The DCM fractions were quenched in 5mL NaHC0 3 aq sat and 
the water phase back extracted 2x5mL CH 2 CI 2 . All the organic phases collected were washed 

15 with water until pH 7.0, dried on Na 2 S0 4 and concentered in vacuo. The crude product was 
treated in the dark overnight at room temperature with 2 equivalents of fluorescein 
isothiocyanate (FITC, Aldrich) in 800pl_ of DMF and 200pL NaHC0 3 aq sa t. Following HPLC 
purification on an XTerra Prep RP 18 column (5pM, 10x1 50mm) using a linear gradient from 10 % 
to 100% MeCN 0.1% TFA, the desired fractions were collected and lyophilized. ESI-MS 

20 analysis confirmed the mass of the expected FITC conjugated products. 02-78 

(C45H49BrN4O10S2) expected: 949.93 measured: 951.31 [M+hT]; 07-78 (C50H59N5O12S2) 
expected: 985.36 measured: 986.37 [M+hT]; 15-78 (C44H49N5012S3) expected: 935.25 
measured: 936.25 [M+hT]; 02-107 (C47H47BrN409S) expected: 923.87 measured: 925.12 
[M+H + ]; 13-40 (C46H49N5014S) expected: 927.30 measured: 928.42 [M+hT]; 11-78 

25 (C49H58N4013S2) expected: 974.34 measured: 975.41; 17-49 (C45H49IN401 1S) expected: 
980.22 measured: 981.29 [M+H+]; 17-78 (C45H49IN4O10S2) expected: 996.19 measured: 
997.26; 16-78 (C43H48N4O10S3) expected: 876.25 measured: 877.33; 15-117 
(C45H45N5012S2) expected: 911.25 measured: 912.33; 02-49 (C45H49BrN4011S) expected: 
932.23; measured: 933.32 [M+hT]. 

30 

Affinity measurements 

Fluorescein-compound conjugates (500nM) were incubated with increasing amounts of 
streptavidin (BIOSPA, cat.no S002-6) in PBS, 5% DMSO, for 1 hour at 25°C. The fluorescent 
polarization was determined with a TECAN Polarion instrument by excitation at 485nm and 
35 measurement at 535nm (e = 72.000 M" 1 cm" 1 ). 
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Results 

As expected, compound 02-78, which displayed the strongest enrichment following streptavidin 
selection, revealed the lowest dissociation constant (Kd = 385 nM). Replacement of the p- 
bromophenyl moiety 02 with compound 07 or with the sulfonamide 15 led to a progressive loss 
5 of enrichment in the selection, which was paralleled by a loss of binding affinities (Kd = 1 1 

and 78 \xM t respectively). The p-bromophenyl moiety 02, which contributes to the binding affinity 
of compound 02-78, was not able to yield high streptavidin binding affinities when used in the 
absence of pharmacophore 78, as revealed by the low enrichment profiles of 02-containing 
compoundsand by the large dissociation constants of compounds in the series (e.g., Kd = 50 
10 \iM for 02-107). 



Compound 


Ko[nM] 


02-78 


0.385 


07-78 


11 


15-78 


78 


02-107 


50 



Example 4: Library screening against IgG 

Summary 

In a similar manner to Example 2, selections against IgG-sepharose were performed. 

15 

Details 

Polyclonal human IgG coating of sepharose beads 

100mg CNBr-activated sepharose (GE Healthcare, Piscataway, NJ) was swollen in 1 mM HCI, 
washed, and mixed in separate tubes with 2.5 mg/ml polyclonal human IgG (Sigma-Aldrich- 
20 Fluka, Buchs, Switzerland) dissolved in 0.1 M Tris-CI, 0.5 M NaCI, pH 8.3. After 4 hour 

incubation at 4°C, the slurry was repeatedly washed and stored in 0.1 M NaAc, 0.5 M NaCI, pH 
4 at 4°C. 

Library Screening 

25 The library of example 1 (total oligonucleotide conjugate concentration 300nM) was diluted 1 :30 
in PBS. 100pL of the library was either added to 50pl IgG-sepharose slurry or to sepharose 
slurry without streptavidin. Both resins were preincubated with PBS, 0.3mg/mL herring sperm 
DNA (Sigma). After incubation for 1 hour at 25°C the mixture was transferred to a SpinX column 
(Corning Costar Incorporated), the supernatant was removed, and the resin washed 4x with 

30 400|jL PBS. After washing, the resin was resuspended in 100pL water. 
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Identification of binding molecules 

The codes of the oligonucleotide-compound conjugates were amplified by PCR (50pL, 25 
cycles of 1 minute at 94°C, 1 minute at 55°C, 40 seconds at 72°C) with either 5pL of 100fM 
DEL4000 library before selection as template, or 5pL of each resuspended resin after selection 
5 as template. The PCR primers DEL_P1_A (5 - GCC TCC CTC GCG CCA TCA G GG AGC TTG 
TGA ATT CTG G-3') and DEL_P2_B (5 - GCC TTG CCA GCC CGC TCA G GT AGT CGG ATC 
CGA CCA C-3') additionally contain at one extremity a 19 nucleotide domain (underlined) 
required for high-throughput sequencing with the 454 Genome Sequencer system. The PCR 
products were purified on ion-exchange cartridges (Qiagen, 28104). Subsequent high- 
10 throughput sequencing was performed on a 454 Life Sciences-Roche GS Fix Sequencer 
platform. 

For each high-throughput sequencing experiments 39.092 sequences have been evaluated and 
plotted on 3D maps, as shown in Figure 5. This shows the results of the high-throughput 
1 5 sequence analysis performed on the library before selection, on fractions eluted from an 

unmodified sepharose beads used as negative control, or from IgG-coated sepharose beads. 

Results 

The decoding of the polyclonal IgG selection showed a preferential enrichment of certain 
20 classes of structurally-related compounds. Typically an enrichment of the derivatives of 

compound 40 as well as of the pharmacophore 69 was observed. Additionally the derivatives of 
the bromide 02 and of thiophene 16 revealed an exceptional enrichement in combination both 
with the moiety 40 and the compound 69. 

25 Example 5: Resynthesis of binding molecules to assess dissociation constants 

In order to assess the dissociation constant of the most relevant compounds selected (see 
Figure 5), fluorescent polarization was measured, following resynthesis of the compounds and 
conjugation to fluorescein. 

30 Details 

Synthesis of the binding molecules as fluorescein conjugates 

In a polypropylene syringe, 50mg (46pmol) of 0-bis-(aminoethyl)ethylene glycol trityl resin 
(Novabiochem, cat.no 01-64-0235) was suspended in a mixture of the appropriate fmoc- 
protected amino acid (100pmol , 1mL), HBTU (Aldrich, 200Mmol , 1mL), and DIEA (Fluka, 
35 400pmol , 0.5mL) in dry DMF. After overnight incubation at 25°C, the resin was washed 6x with 
2mL dry DMF and the fmoc moiety was removed by addition of 1mL piperidine (50% in dry 
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DMF) for 1 hour at 25°C. After washing 6x with 2mL dry DMF, the corresponding carboxylic acid 
(lOOpmol, 1mL DMF) was added and a further amide bond formation reaction was performed 
as described above. The resulting product was cleaved by treating the resin with 10x with 2ml_ 
TFA (1% in CH 2 CI 2 ). The dichloromethylene fractions were quenched in 5mL NaHC0 3 aq sa t and 
5 the water phase was back extracted 2x with 5mL CH 2 CI 2 . The pooled organic phases were 
washed 3x with water, dried on Na 2 S0 4 and concentrated in vacuo. The crude product was 
reacted with 2 equivalents of fluorescein isothiocyanate (800pL of DMF) and 200|jL NaHCOs 
aq S atin the dark overnight at 25°C. Following HPLC purification on an XTerra Prep RP 18 column 
(5pM, 10x1 50mm) using a linear gradient from 10% to 100% MeCN 0.1% TFA, the desired 
10 fractions were collected and lyophilized. ESI-MS analysis confirmed the mass of the expected 
FITC conjugated products. 02-40 (C29H41BrN409) expected: 668.21 measured: 669.37 
[M+H + ]; 16-40 (C27H40N4O9S) expected: 596.25 measured: 597.12 [M+H + ]. 

Affinity measurements 

15 Fluorescein-compound conjugates (500 nM) were incubated with increasing amounts of 

polyclonal human IgG (Sigma-Aldrich-Fluka, Buchs, Switzerland) in PBS, 5% DMSO, for 1 hour 
at 25°C. The fluorescent polarization was determined with a TECAN Polarion instrument by 
excitation at 485nm and measuring emission at 535nm (e = 72000 M 1 cm" 1 ). 

20 The apparent association constant revealed was 200 pM. 

Example 6: Synthesis of IgG-binding resin 

In order to asses the performance of the compounds selected in affinity chromaphy purification 
of a human polyclonal IgG sample, the compounds selected were immobilized on a suitable 
25 affinity chromatograpy resin support and applied both in the affinity chromatograpy of a pure 
sample of polyclonal human IgG Cy5 labelled and in the purification of a crude sample of a 
CHO (Chinese Hamster Ovary) cells supernatant spiked with plyclonal human IgG Cy5 labelled. 

Synthesis of the binding molecules on affinity chromatography resin 
30 In a polypropylene syringe, 50mg (46pmol) of 0-bis-(aminoethyl)ethylene glycol trityl resin 
(Novabiochem, cat.no 01-64-0235) was suspended in a mixture of the appropriate fmoc- 
protected amino acid (100pmol, 1mL), HBTU (Aldrich, 200Mmol, 1mL), and DIEA (Fluka, 
400Mmol, 0.5mL) in dry DMF. After overnight incubation at 25°C, the resin was washed 6x with 
2ml_ dry DMF and the fmoc moiety was removed by addition of 1mL piperidine (50% in dry 
35 DMF) for 1h at 25°C. After washing 6x with 2mL dry DMF, the corresponding carboxylic acid 
(lOOjjmol, 1mL DMF) was added and a further amide bond formation reaction was performed 
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as described above. The resulting product was cleaved by treating the resin with 10x with 2mL 
TFA (1% in CH 2 CI 2 ). The dichloromethylene fractions were quenched in 5mL NaHC0 3 aq sa t and 
the water phase was back extracted 2x with 5mL CH 2 CI 2 . The pooled organic phases were 
washed 3x with water, dried on Na 2 S0 4 and concentrated in vacuo. The crude product was 
5 reacted with 2 equivalents of fluorescein isothiocyanate (800|jL of DMF) and 200pl_ NaHC0 3 
aqsatin the dark overnight at 25°C. Following HPLC purification on an XTerra Prep RPi 8 column 
(5pM, 10x1 50mm) using a linear gradient from 10% to 100% MeCN 0.1% TFA, the desired 
fractions were collected and lyophilized. ESI-MS analysis confirmed the mass of the expected 
products. 200mg CNBr-activated sepharose (GE Healthcare, Piscataway, NJ) was swollen in 1 
10 mM HCI, washed, and mixed in separate tubes with 15pmol of the compounds dissolved in 2mL 
0.1 M NaHC0 3aq , 10% DMF. After 4 hours incubation at 25°C, the slurry was repeatedly washed 
and stored in PBS at 4°C. 

Polyclonal human IgG Cy5 labeling 
15 Polyclonal human IgG (Sigma-Aldrich-Fluka, Buchs, Switzerland) was labelled with Cy5 Mono- 
reactive kit (Amersham, cat.no PA25001) according to the protocol of the provider and purified 
over a PD10 column (GE Healthcare, catno 17-0851-01) as described by the supplier. 

Affinity chromatography on IgG binding resin (Figure 6a) 

20 70mg of L2C2C40 or L2C16C40 resin were loaded on a chromatography cartridge (Glen 
Research, cat.no 20-0030-00) and washed 3x with 1ml_ PBS before loading Cy5 labelled 
polyclonal human IgG solution (216pL, 4.64|jM in PBS). The flow-through of the washing steps 
(washing Ixwith 1mL PBS; Ixwith 1mL 500mM NaCI, 0.5mM EDTA; Ixwith 1mL 100 mM 
NaCI, 0.1% Tween 20, 0.5 mM EDTA) and the elutate (elution 3x with 200pL aqueous 

25 triethylamine (TEA) 100mM) were collected and eventually concentrated back to a final volume 
of 200pL by centrifugation in a Vivaspin 500 tube (Vivascience, cat.no VS0101, cut-off 10.000 
MW). The samples were then analyzed by gel electrophoresis on a NuPAGE 4-12% Bis-Tris 
Gel (Invitrogen, cat.no NP0321) using MOPS SDS as running buffer and stained with 
Coomassie Blue. The results are shown in Figure 6a, wherein: 

30 M = Marker Rainbow Full range 

In = Polyclonal Human lgGCy5 2 nmol in 216 pL PBS 
W1 = 1 mL PBS 

W2 = 1 mL 500 mM NaCI, 0.5 mM EDTA 
W3 = 1 mL 100 mM NaCI, 0.1% Tween 20, 0.5 mM EDTA 
35 E1 = E2 = E3 = 200 pL 0.1 M TEA 
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Affinity chromatography of crude a CHO (Chinese Hamster Ovary) cells supernatant spiked with 
human IgG Cy5 labelled on IgG binding resin (Figure 6b) 

70mg of L2C2C40 or L2C16C40 resin were loaded on a chromatography cartridge (Glen 
Research, cat.no 20-0030-00) and washed 3x with 1mL PBS before loading a crude sample of 
5 CHO (Chinese Hamster Ovary) cells supernatant (60 jjL) spiked with human IgG Cy5 labelled 
(20 |jL, 9.68 pM). The flow-through, the washing steps (washing 1x with 10 ml_ PBS; 1x with 10 
mL 500 mM NaCI, 0.5 mM EDTA; 1x with 10 ml_ 100 mM NaCI, 0.1% Tween 20, 0.5 mM EDTA) 
and the elutate (elution 3x with 200 pL aqueous triethylamine 100 mM) were collected and 
eventually concentrated back to a final volume of 100pL by centrifugation in a Vivaspin 500 tube 
10 (Vivascience, cat.no VS0101, cut-off 10.000 MW). The samples were then analyzed by gel 
electrophoresis on a NuPAGE 4-12% Bis-Tris Gel (Invitrogen, cat.no NP0321) using MOPS 
SDS as running buffer and stained with Coomassie Blue. Cy5 activity was detected by a Diana 
III Chemiluminescence Detection System (Raytest) by excitation at 675 nm and measuring 
emission at 694 nm (e = 250,000 M' 1 cm" 1 ). The results are shown in Figure 6b, wherein: 
15 M = Marker Rainbow Full Range 

In = 60 pL supernatant + 20 lgGCy5 9.68 mM 

W = 10 mL PBS + 10 mL 500 mM NaCI, 0.5 mM EDTA + 

10 mL 100 mM NaCI, 0.1% Tween 20, 0.5 mM EDTA 
E1 = E2 = E3 = 200 pL 0.1 M TEA 
20 Ctrl(+) = Polyclonal Human IgG 

Results 

As it is shown in Figure 6, the human IgG Cy5 capture by the resin and subsequently elution is 
in both the experiments quantitative (Figure 6a and 6b). Additionally Figure 6b shows the high 
25 selectivity of the resin for the capture of human IgG Cy5 after extensive washing and elution. 

Example 7: Stepwise encoding 
Introduction 

Figure 7 shows three different experimental schemes, which could be conceived for the 
30 construction and encoding of a large library. The first scheme features the stepwise addition of 
groups of chemical moieties onto an initial scaffold, using suitable orthogonal chemical 
reactions and/or protecting strategies, followed by the sequential addition of the corresponding 
DNA codes by an iterative ligation procedure. This scheme, which corresponds broadly to 
known methods, is conceptually simple and can be implemented experimentally, but requires 2 
35 pre-annealed oligonucleotides for each encoding event (i.e., 200 + 200 + 200 oligonucleotides 
for a library containing 100 x 100 x 100 chemical groups; Figure 7a). The second synthetic and 
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encoding strategy represents an embodiment of the first aspect of the invention. This method 
uses Klenow polymerization, which would require the lowest number of oligonucleotides for 
library encoding (100 + 100+ 100 oligonucleotides for a library containing 100 x 100 x 100 
chemical groups; Figure 7b). Finally, an embodiment of the second aspect of the invention is 
5 shown/ Here, the double-stranded DNA fragment generated by Klenow filling is followed by a 
digestion with a non-palindromic cutter and by a subsequent ligation step with a complementary 
double-stranded DNA fragment (Figure 7c) 

The feasibility of the experimental procedures is demonstrated below by gel-electrophoretic 
10 analysis, which monitored the stepwise assembly of DNA-fragments of suitable size, and by 
DNA sequencing. 

General methods 

Gel electrophoresis was performed either using 15% Tris-Borate-EDTA-Urea denaturing 
15 polyacrylamide gels (TBE-Urea, Invitrogen, cat.no EC68852) or 20% Tris-Borate-EDTA native 
polyacrylamide gels (TBE, Invitrogen, cat.no EC63152) and stained with SYBRgreenll (Sigma, 
S9305). DNA Ethanol precipitation of DNA was performed by adding 1/10 volumes of 3M 
AcOH/AcONa buffer pH 4.7, and 3 volumes of ethanol relative to the volume of the DNA 
sample. After 2h incubation at -23°C the mixture was centrifuged in a table-top centrifuge for 
20 40min (16.000g) at 4°C, the supernatant removed and the pellet washed with 300 pL ice-cold 
ethanol 90%. After a further 20min centrifugation (16.000g) at 4°C, the pellet was dried and 
redissolved in water. 

Comparative example: Stepwise encoding by Ligation (Method a Underlined sequences 

25 represent coding sequences) 

Hybridization of 3 pairs (A, B, C) of oligonucleotides (A: 5-CAT GGA ATT CGC TCA CTC CGA 
CTA G AG G-3* and 5'-(Phosphate)-CGT ACC TCTAGTCGG AGT GAG CGA ATT CCA TG-3*; 
B: 5'-(Phosphate)-TAC GTG AGC TTG ACC TGG TGA G-3' and 5'- (Phosphate)-GCT TCT 
CAC CA G GTC AA G CTC A-3'; C: 5'-(Phosphate)-AAG C AC GTT C GC TGG ATC CTC AAC 

30 TGT G-3' and 5'-CAC AGT TGA GGA TCC AGC GAA CGT- 3', IBA) was carried out by mixing 
the oligonucleotides at a concentration of 1.25 pM per oligonucleotide in 1x ligase buffer (40 
mM Tris-HCI, 10 mM MgCI2, 10 mM DTT, 0.5 mM ATP, pH 7.8) and incubating the mixtures for 
10 minutes at 50°C. Subsequently the ligations were performed mixing 10 pi of hybridized 
oligonucleotide pairs A and B with 10 pi of 1x ligase buffer and 1 pL of T4 ligase (Roche Applied 

35 Science, Basel, Switzerland), and incubated at room temperature for 2 hours. The ligation 

product was purified using a Qiagen Nucleotide Removal Kit (Qiagen, cat.no 28306), and eluted 
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with 50 pi of 10 mM Tris-HCI pH 8.0. 18 pi of the eluate was mixed with 10 pi of hybridized 
oligonucleotide pair C (which was present in excess), 2 pi of 10x ligase buffer, and 1 pi of T4 
ligase, and incubated for 2 hours at room temperature. Aliquots of the two starting 
oligonucleotides, and the different ligation products were subjected to electrophoresis on a 20% 
5 TBE gel. 

Stepwise encoding by Klenow Polymerase (Method b Underlined sequences represent coding 
sequences) 

To a reaction volume of 50pL, reagents were added to the respective final concentrations: a 
10 42mer 5'-amino-C12-DNA-oligonucleotide (5'-GGA GCT TGT GAA TTC TGG ATC TTA GGA 
CGT GTG TGA ATT GTC-3', IBA), 2pM, 42mer 3'-C6-biotinylated-oligonucieotide (5'-GTA GTC 
GGA TCC GAC CAC GTT CCT GAG AAT TCA CAC ACG TCC-3', IBA), 3pM, Klenow buffer 
(NEB, cat.no B7002S), dNTPs (Roche, cat.no 11969064001), 0.5mM, Klenow Polymerase 
enzyme (NEB, cat.no M0210L), 5 units. The Klenow polymerization reaction was incubated at 
15 37°C for 1h, purified on ion-exchange cartridge (Qiagen, cat.no 28306) and eluted in 100pL of 
4M urea. After incubating at 94°C for 2min, 50pl of streptavidin-sepharose slurry (GE 
Healthcare, cat.no 17-51 13-01) were added and the slurry was incubated for 1h at 4°C. The 
streptavidin sepharose resin and the supernatant were separated by centrifugation in a SpinX 
column (Corning Costar Incorporated). The DNA in the supernatant was ethanol precipitated as 
20 described above. The resulting singele-stranded oligonucleotide was mixed with a 42mer 

unmodified DNA oligonucleotide (5'-GTC GTA TCG CCA TGG TCC AAC ATC GTA GTC GGA 
GAG GAC CAC-3') and a Klenow polymerization reaction was performed as described above. 
Aliquots of the three starting oligonucleotides, and the different Klenow products were applied 
on a 15% TBE-Urea gel. 

25 

Stepwise encoding by "Hybrid Klenow-Ligation" (Method c Underlined sequences represent 
coding sequences)) 

To a reaction volume of 50pL, reagents were added to the respective final concentrations: a 
42mer 5'-amino-C12-DNA-oligonucleotide (5-GGA GCT TGT GAA TTC TGG ATC TTA GGA 

30 CGT GTG TGA ATT GTC-3', IBA), 2pM, a 42mer 5'-C6-biotinylated-oligonucleotide containing 
the non-palindromic BssSI restriction site (in boldface type) (5'-GTA GTC GGA CAC GAG TAC 
TGG TAA TC G ACA ATT CAC ACA CGT CC-3\ IBA), 3pM, klenow buffer (NEB, cat.no 
B7002S), dNTPs (Roche, cat.no 11969064001), 0.5mM, and Klenow Polymerase enzyme 
(NEB, cat.no M0210L), 5 units. After incubation at 37°C for 1h, the reaction mixture was purified 

35 on ion-exchange cartridge (Qiagen, cat.no 28306) and eluted in 25 pL of water. 8 units of BssSI 
enzyme were added to the purified Klenow product in 50pL of BssSI restriction buffer (NEB, 
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cat.no B7003S) . The restriction cutting reaction was carried out at 37°C for 1.5h. 50pL of 
streptavidin-sepharose slurry (GE Healthcare, cat.no 17-5113-01) was added and the slurry 
was incubated for 30min at 4°C. After SpinX column centrifugation (Corning Costar 
Incorporated), the supernatant was collected and purified on ion-exchange cartridge (Qiagen, 
5 cat.no 28306) and eluted in 25 pL of water. Subsequently we added the following reagents to 
the final volume of 50mL: preincubated mixture 1:1 of hybridized oligonucleotides (27mer 5- 
pyrophosphate-TCG TG A AAT TT G CTA GGA TCC ATA TTG-3* and 23mer 5 -CAA TAT GGA 
TCC TAG C AA ATT T C-3\ IBA), 3|jM, T4 ligase buffer (Roche Applied Science, Basel, 
Switzerland) and T4 ligase (Roche Applied Science, Basel, Switzerland), 4 units. The ligation 
10 was performed overnight at 16°C then purified on ion-exchange cartridge (Qiagen, cat.no 
28306). Aliquots of the starting oligonucleotides, and the different Klenow, restriction and 
ligation products were analyzed on a 15% TBE-Urea gel. 

Results 

15 The three methods described above allow to succesfully complete the stepwise encoding (see 
gels Figure 7). Compared with the ligation method (method a, Figure 7), the Klenow 
Polymerase and the hybrid Klenow-ligation methods allow the same stepwise encoding 
minimizing the number of oligonucleotides required (method b and c, Figure 7). 

20 Example 8 

Introduction 

Figure 8a shows a further experimental scheme, which could be conceived for the construction 
and encoding of a large library according to the third aspect of the invention. 

25 The feasibility of the experimental procedures is demonstrated below by gel-electrophoretic 
analysis, which monitored the stepwise assembly of DNA-fragments of suitable size, and by 
DNA sequencing. The analysis was carried out using the same technique as described in 
Example 7. 

30 Stepwise encoding by "Klenow-Ligation" (Underlined sequences represent coding sequences) 
To a reaction volume of 50pL, reagents were added to the respective final concentrations: a 
42mer 5'-amino-C12-DNA-oligonucleotide (5'-GGA GCT TGT GAA TTC TGG ATC TTA GGA 
CGT GTG TGA ATT GTC-3', IBA), 320nM, a 44mer oligonucleotide containing the restriction 
site BamHI (boldface type) (5'-GTA GTC GGA TCC GAC CAC GCA TAT AAG ACA ATT CAC 

35 ACA CGT CC-3'.IBA), 600nM, klenow buffer (NEB, cat.no B7002S), dNTPs (Roche, catno 
1 1969064001), O.SmM, and Klenow Polymerase enzyme (NEB, cat.no M0210L), 5 units. After 
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incubation at 37°C for 1 hour, the reaction mixture was purified on ion exchange cartridge 
(Qiagen, cat.no 28306) and eluted in 33 of water. 10 units of BamHI enzyme were added to 
the purified Klenow product (100nM) in 50pL of BamHI restriction buffer (NEB, cat.no B7003S) 
containing 0.1mg/ml_ BSA. The restriction cutting reaction was carried out at 37°C for 12 hours 
5 and purified on ion-exchange cartridge (Qiagen, cat.no 28306) and eluted in 50 pL of water. 
Subsequently the following reagents were added to a reaction volume of 25pL: purified 
restriction product (20nM), preincubated mixture 1:1 of hybridized oligonucleotides (43mer 5 - 
pyrophosphate-GAT CCG CTG CCC CGC CAA AAA AAA GAT CTG ATG GCG CGA GGG A - 
3' and 12mer 5 - GGC GGG GCA GCG \ Sigma), 100nM, T4 ligase buffer (NEB) and 1 \jL T4 
10 ligase (NEB, cat.no M0202L). The ligation was performed overnight at 16°C and then purified 
on ion exchange cartridge (Qiagen, cat.no28306). The final product was PCR amplified using 
the primers 5'-GGA GCT TGT GAA TTC TGG-3 and 5 X -GCC TCC CTC GCG CCA TCA GAT 
C-3* and analysed by subcloning and sequencing. 

15 Aliquots of the starting oligonucleotides, and the different Klenow, restriction and ligation 

products were analyzed on a 20% TBE gel (Figure 8b), wherein the reference numerals relate 
to Figure 8a. 
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Claims 

1 . A method of preparing a DNA-encoded chemical library comprising members which 
comprise a chemical moiety, a linking moiety and a DNA moiety, comprising for each member 
5 the steps of: 

(I) coupling a initial building block to a initial coding single strand oligomer via a linking 
moiety to form a initial conjugate, the initial coding single strand oligomer comprising a first 
primer region, an initial coding region and a first annealing region; 

(II) optionally coupling a middle building block and middle coding single strand oligomer to 
10 said initial conjugate to form a middle conjugate, the coupling take place in either order, 

wherein: 

(a) the middle building block is coupled to the residue of the initial building block; and 

(b) the middle coding single strand oligomer comprises a middle coding region and second 
annealing region, and is coupled by: 

15 (i) annealing a complementary single strand oligomer which comprises a chemical modifier, 
a complementary first annealing region, a complementary middle coding region and a 
complementary second annealing region by interaction between the first annealing region and 
the complementary first annealing region; 

(ii) treating the conjugate formed with a DNA polymerase to elongate the initial coding 

20 single strand oligomer to be complementary to the complementary middle coding region and the 
complementary second annealing region; 

(iii) removing the complementary single strand oligomer by denaturing the DNA and 
capturing the complementary single strand oligomer via the chemical modifier; 

(III) coupling a final building block and final coding single strand oligomer to the initial or 
25 middle conjugate as appropriate to form a final conjugate, the coupling take place in either 

order, wherein: 

(a) the final building block is coupled to the residue of the initial building block, or may be 
additionally or alternatively be coupled to the residue of the middle building block (if present); 
and 

30 (b) the final coding single strand oligomer comprises a final coding region and a second 
primer region, and is coupled by: 

(i) annealing a complementary single strand oligomer which comprises a complementary 
first or second annealing region as appropriate, a complementary final coding region and a 
complementary second primer region, by interaction between the first or second annealing 

35 region and the complementary first or second annealing region, as appropriate; 

(ii) treating the conjugate formed with a DNA polymerase to elongate the initial coding 
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single strand oligomer to be complementary to the complementary final coding region and the 
complementary second primer region, and to elongate the complementary coding strand to be 
complementary to the initial coding single strand oligomer. 

5 2. A method of preparing a DNA-encoded chemical library comprising a chemical moiety, a 
linking moiety and a DNA moiety, comprising for each member the steps of: 
(I) coupling a initial building block to a initial coding single strand oligomer via a linking 
moiety to form a initial conjugate, the initial coding single strand oligomer comprising a primer 
region, an initial coding region and an annealing region; 
10 (II) coupling a middle building block and middle coding single strand oligomer to said initial 
conjugate to form a middle conjugate, the coupling take place in either order, wherein: 

(a) the middle building block is coupled to the residue of the initial building block; and 

(b) the middle coding single strand oligomer comprises a middle coding region and a 
restriction region, and is coupled by: 

15 (i) annealing a complementary single strand oligomer which comprises a complementary 
annealing region, a complementary middle coding region, a complementary restriction region 
and a chemical modifier, by interaction between the annealing region and the complementary 
annealing region; 

(ii) treating the conjugate formed with a DNA polymerase to elongate the initial coding 

20 single strand oligomer to be complementary to the a complementary middle coding region and 
the complementary restriction region and to elongate the complementary single strand oligomer 
to be complementary to the initial primer region and initial coding region; and 

(iii) treating the conjugate with a non-palindromic restriction enzyme which acts on the 
restriction region to leave a middle conjugate, and capturing the restriction product via the 

25 chemical modifier; 

(III) coupling a final building block and final coding single strand oligomer to the middle 
conjugate as appropriate to form a final conjugate, the coupling take place in either order, 
wherein: 

(a) the final building block is coupled to the residue of the initial building block and/or to the 
30 residue of the middle building block; and 

(b) the final coding single strand oligomer comprising a final coding region and a second 
primer region, and is coupled by ligating a terminal double-stranded portion comprising a 
ligation region that matches the remainder of the restriction region site of the middle conjugate, 
and a final coding region and second primer region, as well as the complementary regions in 

35 the complementary strand. 
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3. A method of preparing a DNA-encoded chemical library comprising members which 
comprise a chemical moiety, a linking moiety and a DNA moiety, comprising for each member 
the steps of: 

(I) coupling a initial building block to a initial coding single strand oligomer via a linking 
5 moiety to form a initial conjugate, the initial coding single strand oligomer comprising a first 

primer region, an initial coding region and a first annealing region; 

(II) coupling a middle building block and middle coding single strand oligomer to said initial 
conjugate to form a middle conjugate, the coupling take place in either order, wherein: 

(a) the middle building block is coupled to the residue of the initial building block; and 
10 (b) the middle coding single strand oligomer comprises a middle coding region and a 
restriction region, and is coupled by: 

(i) annealing a complementary single strand oligomer which comprises a complementary 
first annealing region, a complementary middle coding region and a complementary restriction 
region by interaction between the first annealing region and the complementary first annealing 

15 region; 

(ii) treating the conjugate formed with a DNA polymerase to elongate the initial coding 
single strand oligomer to be complementary to the complementary middle coding region and the 
complementary restriction region; 

(iii) treating the conjugate with a restriction enzyme which acts on the restriction region to 
20 leave a middle conjugate, and removing the restriction product with a chromatographic method; 

(III) coupling a final building block and final coding partially double stranded oligomer to the 
middle conjugate to form a final conjugate, the coupling take place in either order, wherein: 
(a) the final building block is coupled to the residue of the initial building block, and 
additionally or alternatively be coupled to the residue of the middle building block; and 

25 (b) the final coding partially double stranded oligomer comprises a final coding region, a 
second primer region, and a double stranded ligation region that matches the remainder of the 
restriction region site of the middle conjugate, and is coupled by ligation using an excess of the 
final coding partially double stranded oligomer. 

30 4. A method according to any one of claims 1 to 3, wherein the individual method steps are 
carried out at about the same time for each library member. 

5. A method according to any one of claims 1 to 4, wherein the relative amounts of the 
individual members within the library varies from about 0.2 equivalents to about 10 equivalents. 

35 

6. A method according to any one of claims 1 to 5, wherein the building blocks are selected 
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from compounds having one or more of the following moieties: amine, carboxylic acid, ester, 
acid anhydride, acid halide, a double bond conjugated to an electron withdrawing group, thiol, 
alcohol, enolate, azide, alkyne, hydroxy, halogen, OTf, ONf, OTs, ketone and boron containing 
moiety. 

5 

7. A method according to claim 6, wherein the initial building block comprises a protected 
amine group which is deprotected before being coupled to a middle or final building block as 
appropriate. 

10 8. A method according to any one of claims 1 to 7, wherein the linker moiety has a chain 
length of 1 to 20 atoms. 

9. A method according to any one of claims 1 to 8, wherein the first and second primer 
moieties comprise between 16 and 28 nucleotides. 

15 

10. A method according to any one of claims 1 to 9, wherein the annealing regions comprise 
between 10 and 28 nucleotides. 

11. A method according to any one of claims 1 to 10, wherein the chemical modifier, if 
20 present, is a biotin or imminobiotin moiety. 

12. A DNA-encoded chemical library synthesised by a method according to any one of 
claims 1 to 1 1 . 

25 13. A DNA-encoded chemical library comprising members which comprise a chemical 

moiety, a linking moiety and a DNA moiety, wherein the chemical moiety is the reaction product 
of an initial building block and a final building block, the initial building blocks being selected 
from table 1 and the final building blocks being selected from table 2 where the library 
comprises members representing the reaction product of at least three initial building blocks and 

30 at least 5 final building blocks, and the DNA moiety comprises a first primer region, an initial 
coding region, a first annealing region, a final coding region and a second primer region, 

14. A DNA-encoded chemical library according to claim 13, wherein the linking moiety is a 
5'-amino modifier C12. 

35 

15. A DNA-encoded chemical library according to either claim 13 or claim 14, wherein the 
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initial coding region comprises 6 base pairs and the final coding region comprises 8 base pairs. 

16 A DNA-encoded chemical library according to any one of claims 13 to 15, wherein the 
first annealing region comprises 18 base pairs. 

5 

17. A DNA-encoded chemical library according to any one of claims 13 to 16, wherein the 
first and second primer regions are usuable with the PCR primers DEL_P1_A and DEL_P2_B 
respectively. 

10 18. A compound selected from: 




19. Use of a compound according to claim 18 as an affinity ligand in an affinity 
chromatography resin for IgG. 

15 20. An affinity chromatography resin for IgG comprising as the affinity ligand a moiety 
comprising a pair of constituent moieties being: 



a) 



b) 



c) 




WO 2009/077173 



PCT/EP2008/010770 



87 



d) 




21. An affinity chromatography resin for IgG according to claim 20, wherein the the pair of 
constituent moieties are: 



a) 



b) 




5 22. An affinity chromatography resin for IgG according to either claim 20 or claim 21 , 
wherein 

the constituent moieties are linked together and to the sold support by a chemical linking group, 
which is of the formula: 




10 wherein X is selected from O or CH 2 ; 
n is 0, 1 or 2; and 
m is 2, 3 or 4. 

23. An affinity chromatography resin for IgG according to any one of claims 20 to 22, 
15 wherein the affinity ligand is selected from: 




16-40 
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24. The use of the affinity chromatography resin according to any one of claims 20 to 23 for 
the purification of IgG. 
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